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SPECIFICATION 
TITLE 

METHOD AND ARRANGEMENT FOR REDUCING A NUMBER ff' / 
OF MEASURED VALUES OF A TECHNICAL SYSTEM % /2y(?^ 

BACKGROUND OF THE INVENTION 

Field of the Invention 

The present inventive method relates to a method and an arrangement for 
reducing a number of measured values of a technical system. 



Description of the Related Art 

A technical system, for example a process engineering plant, supplies a large 
quantity of measured values per unit time, using different measured-value pickups 
(sensors). In the course of a number of days or weeks, a quantity of data accumulates 
which requires a correspondingly high computational power to process it. If the 
measured values are to be used to adapt or reset the technical system, operations are 
often necessary whose complexity admits only some of the measured values. It is then 
a great disadvantage to select a specific number of measured values arbitrarily from the 
quantity of all measured values and to process them further, since measured values 
with low significance, for example measured values with a high degree of interference, 
have a considerable influence on the overall result and falsify the latter considerably. 
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SUMMARY OF THE INVENTION 

The object of the invention is to specify a method and an arrangement for 
reducing measured values, it being ensured that the remaining measured values have 
a high significance with regard to their description of the technical system. In order to 
achieve the object, a method of reducing a number of measured values of a technical 
system is specified, in which the measured values are divided into classes in 
accordance with predefined criteria. The measured values in a class are assessed and 
those measu red values whose assessment lies below a predefined first threshold value 
are screened out. 

Screening out the measured values results in a reduction with regard to the 
number of measured values. A considerably reduced number of measured values is 
thus available for further processing. The further processing can be carried out with a 
lower computational outlay as compared with the non-reduced number of measured 
values. 

In one embodiment, the classes themselves are also assessed. In particular, a 
class whose assessment lies below a predefined second threshold value can be 
screened out. This results in an additional reduction in the number of measured values. 

In another embodiment, one criterion forthe division into classes consists in that, 
for each class, measured values relating to a predefinition of setting parameters of the 
technical system are determined. Typically, the technical system is set using a 
predefined number of setting parameters and after the setting has been carried out, 
there follows a (mostly time-delayed) reaction of the system to the setting parameters 



(transient response, transient process of the system). After the setting operation, a 
specific quantity of measured values which can be associated with the transient process 
are therefore picked up, measured values which are associated with the predefined set 
of setting parameters continuing to accumulate after the transient process has 
concluded (transition to steady-state operation). By adjusting the setting parameters, 
a new class is determined. All the measured values which respectively accumulate 
after an adjustment to the setting parameters belong in their own class. Another 
development consists in that measured values within a class which can be associated 
with the respective transient process are screened out. In addition, erroneous 
measured values can be screened out. In many cases, the setting of large technical 
systems depends on long-term, steady-state operation. Measured values which relate 
to the transient process (of short duration in comparison with the steady-state operation 
after the transient process has concluded) are expediently screened out, since 
measured values for the steady-state operation are falsified by them. In particular 
within the context of modeling the technical system, the measured data about the 
steady-state behavior of the technical system are of interest. 

One refinement of this embodiment consists in reducing the number of 
measured values in a class in that at least one representative value for the measured 
values in the class is determined. Such a representative value may be: 

a) an average (e.g. a sliding average) of the measured values of the class, 

b) a maximum value of the measured values of the class, 

c) a minimum value of the measured values of the class, 



d) a median. 

In the case of variant (d), one advantage resides in the fact that it is always 
possible to determine a value which is actually present, while the average (a) does not 
itself occur as a value. 

Depending on the application, a suitable selection for determining the 
representative value of a class may be made. 

An entire class with measured values can be screened out if said class contains 
less than a predefined number of measured values. 

Another refinement consists in that those measured values are screened out 
which differ from a predefinable value by more than a predefined threshold value. 

The predefinable value can be an average of all the measured values of the 
class or a measured value to be expected in response to the respective setting 
parameters of the technical system. 

A development consists in that the measured values which are reduced in 
number are used for the simulation and/or the draft design of this or another technical 
system. 

The draft design of the technical system can in this case comprise both 
adaptation and redrafting of this or another technical system. 

In addition, within the context of the simulation, the behavior of the technical 
system can be checked by using the reduced number of measured values, with the 
objective of altering the system for developing a new system with altered predefinitions. 



In addition, in orderto achieve the object, an arrangement for reducing a number 
of measured values of a technical system is specified which has a processor unit, which 
processor unit is set up in such a way that the measured values can be divided into 
classes in accordance with predefined criteria. Measured values in a class can be 
assessed, measured values whose assessment lies below a predefined first threshold 
value are screened out. 

This arrangement is particularly suitable for carrying out the method according 
to the invention or one of its developments explained above. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 shows a block diagram which contains steps in a method for reducing 
measured values. 

Fig. 2 shows a schematic sketch of a recovery boiler. 

Figs. 3 through 5 show input variables, actuating variables and outward variables 
of the recovery boiler. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Fig. 1 shows a block diagram which contains steps in a method of reducing 
measured values. In a first step 101, the measured values are divided into classes. 
In particular, the division into classes is carried out with the effect that all the measured 
values which belong to one set of setting parameters are combined in one class. To 
this extent, each alteration to the setting parameters of the technical system found a 



new class. In particular, altering the setting parameters is associated with a transient 
process of the technical system, this process, as opposed to a steady-state behavior, 
containing extreme fluctuations in the measured values. In a step 102, individual 
measured values within one class are screened out. These may be, for example, 
erroneous measured values, that is to say measured values which exhibit a high 
deviation with respect to the other measured values or an average of the measured 
values or measured values from the transient process. There are a number of possible 
ways of screening out individual measured values within a class: 

1 . Measured values which are too poor (based on a predefined comparative 
value); 

2. Measured values which belong to a transient process; 

3. Measured values which are erroneous; 

4. Determining a representative measured value as a representative for a 
plurality of measured values, in that the representative measured value 
is determined as an average of the measured values in a class or as a 
maximum value or a minimum value of these measured values. 

Measured values of this type are preferably not taken into account; they are 
removed from the respective class. This results in a considerable reduction in the 
number of measured values. In a step 103, individual classes are screened out. One 
criterion for screening out an entire class consists in that the class contains less than 
a predefined number of measured values. In a step 104, the measured values reduced 




in number are used for further processing. Further processing is, in particular, a 
simulation and/or a draft design of the technical system. 

Fig. 2 shows a schematic sketch of a recovery boiler. In the following text, by 
using the example of a "recovery boiler 1 ', an exemplary embodiment of the method 
5 described above will be illustrated. 

In the paper and pulp industry, various chemicals and also heat and electrical 
power are needed for the digestion of pulp. With the aid of the recovery boiler, the 
chemicals used and additional thermal energy may be recovered from a thickened 
4 waste process liquor (black liquor). A measure of the recovery of the chemicals is of 
njo critical importance for the economy of the overall plant. 

£ The black liquor is burned in a char bed 201 . In the process, an alkaline melt is 

M produced and flows away via a line 202. In further process steps, the chemicals used 
O are recovered from the constituents of the alkaline melt. Heat of combustion which is 
~ released is used to generate steam. The combustion of the waste liquor and therefore 
jj5 the recovery of the chemicals begins with the atomization of the black liquor via 
atomizer nozzles 204 into a combustion chamber 203. As they fall through the hot flue 
gas, particles of the atomized black liquor are dried. The dried liquor particles fall onto 
the char bed 201, first combustion and chemical reduction taking place. Volatile 
constituents and reaction products pass into an oxidation zone, in which oxidizing 
20 reactions proceed and in which the combustion is completed. 
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Important objectives for the control of the recovery boiler are the steam 
production in order to obtain power, compliance with emission values from 
environmental points of view and the efficiency of the chemical reduction. 

The combustion operation, and therefore the objectives, are controlled in 
particular by the supply of air at three levels (primary air (PA), secondary air (SA), 
tertiary air (TA)). The overall process is subject to numerous influences, which have to 
be taken into account during the modeling: 

a) the measurement of the variables is subject to fluctuations which are 
often extreme; 

b) influencing variables which are not measured and cannot be measured 
exist; 

c) at each alteration to the settable parameters transient processes occur; 

d) the technical plant becomes soiled and is cleaned at predefined intervals, 
which has the effect of a drift over time in each case in the system 
behavior. 

The measured variables of the overall process are subdivided into input variables 
(cf. Fig. 3) and output variables (cf. Fig. 5). Measured variables are stored every 
minute. Four of the input variables are simultaneously also actuating variables (also: 
settable parameters; cf. Fig. 4). The actuating variables are to be viewed substantially 
as free parameters of the overall process which can be set independently of one 
another. Some of the other input variables are more or less dependent on the actuating 
variables. According to one predefinition, the variables "BL Front Pressure" and "BL 
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Back Pressure" are always to be regulated equally in the recovery boiler. The four 
actuating variables (cf. Fig. 4) are preferably to be stored as actuating variables (with 
the desired, preset value) and as input variables (with the measured, actual value). 
In the recovery boiler, one problem consists in the fact that, depending on the 
5 settable parameters, specific objectives, which are defined via measured variables, 
have to be met. Here, a three-stage procedure is selected in order to solve the 
problem: 

" 1 . The objectives to be considered are modeled by means of stochastic methods, 
jj these models being updated by means of new measurements (data-driven, 

JfcO empirical modeling). In this case, it is expedient to use not just a single model 

but global models for the identification of interesting areas in a parameter space 
g determined by the objectives, and to use local models for the exact calculation 

of optimum operating points. The models used are assessed by means of 
J~ quality measures. 



15 2. If the models considered are not sufficiently accurate because of the state of the 
data (quality measure), new operating points are deliberately evaluated in order 
to improve the model (experimental design). In addition, by using global 
stochastic optimization methods with regard to the objectives, attractive regions 
are identified on the basis of the current global model. 
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3. For the local optimization, local models are constructed, and the data sets which 
are available are, if appropriate, deliberately expanded (experimental design). 
The objectives are physical/technical or economic criteria which, as a rule, have 
to meet boundary conditions and/or safety conditions. It is often the case that a number 
5 of these criteria have to be considered at the same time. A stochastic model can be 
used in particular for the purpose of simulating the objectives to be optimized and their 
dependence on the parameters to be set in the computer. This is necessary when 
measurements are very costly or very time-consuming. In the case of safety 
" requirements, possible hazardous situations can be avoided. 

yhO In the case of the recovery boiler, on-line optimization, which is based on a 

£ plurality of items of data, is necessary, since the physical/chemical processes cannot 
M 8 be modeled quantitatively with sufficient accuracy and because the behavior of the 
2 plant is subject to fluctuations in the course of operation. The knowledge about this 
^ behavior must continually be expanded by means of the deliberate selection of new 
J5 operating points. Therefore, within the context of on-line optimization, the above- 
described three-stage procedure of stochastic modeling at mathematical optimization 
is to be recommended. 

DESCRIPTION OF THE INPUT VARIABLES 

The a input variables (a e N, N: set of natural numbers) generally depend on n 
20 actuating variables neN and on random effects. They can be described as follows: 
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Let (Q, S, P) be a probability space and B v be a Borel a-algebra over R v (R: set of real 
numbers) for each v e N. The input variables are represented by a projection cp which 
can be measured via B n x S - B a : 

cp: R n x Q - R a (1 ). The definition set of the projection 

cp is a cartesian product of two sets. If one considers the respective projections onto 
the individual sets, then the following projections are obtained: 

cp x : n R a , CO cp(x, o) for all x e R n (2), 
cp 10 : R n R a , x <p(x, co) for all o> € fi (3) . 



|<p x ; x g R n l 

1 ; is a stochastic process having an index set R n and a projection cp w is 

a path in this stochastic process for each event weQ. 

In the case of the recovery boiler, n=4 and a=14 (following the elimination of the 
variable "BL Back Pressure"). 

Because of the required ability to measure the projection cp x for each x e R n , the 
projection cp x is a random variable. Under suitable additional preconditions, expected 
values and higher moments can be considered. This access makes the step possible 
from stochastic models to deterministic optimization problems. In the case of a 
deterministic optimization problem, the target function can be set directly by means of 
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a variable, while the stochastic variable influences the target function but does not 
permit any deliberate setting. 



DESCRIPTION OF THE OUTPUT VARIABLES 

The process model M of the recovery boiler will be described as a function 
depending on the input variables and further random effects. In this case, let (Q, S, P) 
be the above probability space. The process model M is then a projection which can 
be 

measured by B a x S - B b : 

M : R a x H R b (4) , 

where b refers to the number of output variables. 

Since the recovery boiler is subject to a cyclic drift over time (from cleaning 
phase to cleaning phase), a description using a time parameter is also conceivable. 
The output variables may be represented by projections i that can be measured by B n 
x S - B b : 

v|/ : R n x Q. -> R b (5) , 

(x, to) -> M((p(x, co), co) (6) . 
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If the respective projections onto the individual sets of the definition set are 
considered, then the following projections are obtained 

v|/ x : n R b , co \\/(x, o>) for all x e R n (7), 
Vj/ 0 *: R n -» R b , x i|/(x, o>) for all (D € Q (8) . 



(v|/ x ; x e R n | 

; is a stochastic process having an index set R n , and the projection ip w 

is a path in this stochastic process for each weQ. 
In the recovery boiler, b=1 5. 

The fact that, when defining tp , no distinction is drawn between the events co 
used, does not mean that there is any restriction, since Q can be represented as a 
cartesian product of an Q1 and an Q2. The above representation therefore also 
comprises the model: 

V : R n x x n 2 ^ R b (9) ' 

(x, G>1, ©2) M(<p(x, G>l)# ©2) (10). 
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DESCRIPTION OF THE DATA SETS AVAILABLE 

Using the descriptions in the two preceding sections, it is possible to combine the 
input variables and the output variables together to form measured variables <t>. <t> is 
a projection that can be measured by B n x S - B m , where m = a + b, and 



0) : R n x n 



(11) , 



(x, co) 



W(x, ©), 



(12) 



If the respective projections onto the individual sets of the definition set are 
considered again, then the following projections are obtained: 

* X :D -> R m , co -» <t>(x, co) for all x e R n (13), 
O^iR 11 R m , x cj>( x , co) for all o) e n (14) . 



fa> x ; x e R n } 

Ms a stochastic process with an index set R n and the projection O w is a 
10 path in this stochastic process for each coeQ. 
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For each chosen tuple x of actuating variables, a large number of 
implementations of O x in the recovery boiler are determined and stored, that is to say 
for each Xj e R n , numerous implementations 

cpj k := 4>(xj, co jk ) <15> 
with cq j k € Q ; k = 1,2, . . . , Vj ; 

v-j e N ; j = 1,2, . . . , u ; u e N 



are considered. The stored data sets Djk of the recovery boiler are therefore (n + m) 
tuples: 



D jk 



k = 1,2, 



V-i 



j = 1/2, 



/ u 



(16) 



Here, D j1k1 is stored before D j2k2 if 

Oi < 32) v (Oi = 32) a (*i < k 2 )) 

DATA COMPRESSION BY DIVIDING THE CLASSES OF PARAMETERS 

Since, for each tuple x of actuating variables, there are generally a number of 
implementations of O x , because of the complex stochastic properties of the process to 
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be considered, the first step in the statistical data analysis is obviously to divide the 
classes of parameters by forming arithmetic averages. In addition, obviously erroneous 
data sets are separated out. An obviously erroneous data set is, for example, a 
physically impossible measurement which cannot possibly occur in real terms, in 
particular on the basis of a setting which has been made. 
Procedure: 

1 . Data sets in which the variable "BL Front Pressure" is not equal to the variable 
"BL Back Pressure" are screened out, since these two values must be equal 
according to the predefinition of the plant control system. The loss of data is 
very small. 

2. The data sets are divided into classes in which the four setting parameters (PA, 
SA, TA, BL Front Pressure, see above) are successively constant overtime, that 
is to say the jth class consists of the data sets D *. 

3. Classes in which there are fewer than 30 data sets are screened out, in order 
that transient processes do not have any great influence. 

4. For each class, an arithmetic average O j and an empirical standard deviation 
sj are determined for all the measured variables: 
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5. Classes in which the averages for the variables PA, SA, TA or BL Front Pressure 
are too far removed from the corresponding setting parameters are screened 
out. In these classes, therefore, the setting values could not be reached. 

CHARACTERISTIC STATISTICAL VARIABLES FOR THE GIVEN CLASSES AND 
THEIR GRAPHICAL REPRESENTATION 
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In addition to the arithmetic averages and the empirical standard deviations 
which have been determined for the individual classes, a common standard deviation 
s is further determined in accordance with 



s = 



V 



3 = 1 



u 



7TT' £b-i)* 

j = l 



1 

\ — 
2 



(19) 



Here, u stands for the number of classes (205 here) and v for the sum of v j( that 
is to say v is the number of all the measured values used (38,915 here). 

Although other modifications and changes may be suggested by those skilled 
in the art, it is the intention of the inventors to embody within the patent warranted 
hereon all changes and modifications as reasonably and properly come within the 
scope of their contribution to the art. 
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